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ABSTRACT 

We present a statistical analysis of voids in the 2dF galaxy redshift survey (2dFGRS). 
In order to detect the voids, we have developed two robust algorithms. We define voids 
as non-overlapping maximal spheres empty of halos or galaxies with mass or luminosity 
above a given one. We search for voids in cosmological iV-Body simulations to test 
the performance of our void finders. We obtain and analyze the void statistics for 
several volume-limited samples for the North Galactic Strip (NGP) and the South 
Galactic Strip (SGP) constructed from the 2dFGRS full data release. We find that 
the results obtained from the NGP and the SGP are statistically compatible. From the 
results of several statistical tests we conclude that voids are essentially uncorrelated, 
with at most a mild anticorrelation and that at the 99.5% confidence level there is a 
dependence of the void number density on redshift. We develop a technique to correct 
the distortion caused by the fact that we use the redshift as the radial coordinate. 
We calibrate this technique with mock catalogues and find that the correction might 
be of some relevance to carry out accurate inferences from void statistics. We study 
the statistics of the galaxies inside nine nearby voids. We find that galaxies in voids 
are not randomly distributed: they form structures like filaments. We also obtain the 
galaxy number density profile in voids. This profile follow a similar but steeper trend 
to that follow by halos in voids. 

Key words: large-scale structure of Universe; cosmology: observations; methods: 
statistical 



1 INTRODUCTION 

There are many tests to constrain the models of structure 
formation, which rang e from the large-scale statistics such as 
correlation functions JP eeble^|[l980|: [D avis fc Peebles! Il983t 
IZehavi e^alj 2002: Norberg_et_alJ|2002j) o r the power spec - 
trum JPeebledll98ttlTegmark et aUl2004 ICole et al.ll20o3) . 
to detailed studies of the physical properties of individ- 
ual galaxy clusters and voids. Although dense environments 
(i.e., galaxy clusters and groups) have been extensively stud- 
ied, the underdense regions like giant voids attract less at- 
tention. Yet, they are not less important a nd can provide im- 
portant information on galaxy formatio n ()HoYie_^t_ajJ|2005|; 
iGoldberg et"afll200Et iRoias et alJl2005t ICroton et al-lhooa) 
and can give independent constraints on cosmological mod- 



* E-mail: spatiri@iac.es 



els (iPeebles Il200ll: iPlionis fc Basilakos IbOOg ICroton et al. 
[20041 : IColberg et alJl2005l : ISolevi et alJ 120051: IConrov et al. 

2005). Large voids have woken up more and more interest 
since their first detections 25 years ago (Kirshner et al. 1981, 
Rood 1981). At present, thanks to the advent of la rger red- 
shift s urveys like the 2d FRGS JColless et al.ll200ll) and the 
SDSS (lYork et al.l l200oh. higher resolution of cosmological 
simulations and better analytical frameworks, we can ex- 
tract accurate statistical information about voids. This in- 
formation can be used in different ways but one of the most 
important is to test the models of structure formation. 

Voids can be studied in different ways. One of t he clas- 
sical methods i s the Void Probability Function (VPF,[Whitc 
ll979tlfVvll986l) . which gives the probability that a randomly 
located sphere of a given radius contains no galaxies (see e.g. 
Einasto et al 1991, Croton et al 2004, Solevi et al 2005). The 
number density of voids with radius greater than R is an- 
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other useful statistic. This number density and also the void 
significance can be estimated analytically (see Patiri et al. 
2004). So far, it has bee n done only using nu merical simu- 
lations or mock catalogs dColberg et al]l2005ft . 

Voids in the 2dFGRS wer e previously studie d 
bv iHovle fc Voeelevl <2004) and ICroton et all <2004V 
ICroton et alJ l|2004h have measured the VPF for volume- 
limited galaxy samples covering the absolute magnitude 
range Mb 3 — 51og 10 h — —18 to —22. Their work mainly fo- 
cused on the study of the dependence of the VPF on the mo- 
ments of galaxy clustering as a test to discriminate among 
different clustering models. They found that the VPF mea- 
sured from the 2dFGRS is in excellent agreement with the 
paradigm of hierarchical scaling of the galaxy clustering. 
In addition, they showed that the negative binomial model 
gives a good approximation of th e 2dF data over a wide 
range of scales. On the other hand. lHovle fc Vogelevl <l2004l) 



have also calculated the VPF in the 2dFGRS obtaining sim- 
ilar results. They have obtained the VPF for the dark mat- 
ter matter halos in ACDM simulations and galaxy mock 
catalogs from semi-analytic models of galaxy formation to 
compare with the data. They have found that the results 
from the semi-analytic models that include feedback effects 
provide a VPF that agree with the VPF measured for the 
2dFGRS and differ from that measured from the dark mat- 
ter distribution. 

In spite of the fact that the notion of voids is not 
new, there is no standard definition of what is a void. 
"Voids" sometimes mean quite different objects. It all de- 
pends on used data and goals of the analysis. For exam- 
ple, to expla in the patterns of the galaxy distributi on in 
the Universe IVan de We veaert fc Van Kampenl (1 19931) and 
ISheth fc Van de \^^aert] <|2004T ) define vo ids as irregu- 
lar low -density regions in the density field. IColberg et alJ 
i2005f) use a similar definition to study void properties in 
a ACDM universe. However, as these definitions are not 
based on point distributions, it may be difficult to deal 
with th e galaxy samples provided by large scale redshift 



iranl ( 1997V H ovle fc Voeelevl (1200 



surv eys. lEl-Ad H. fc . 

and IHovle fc Voeelevl J2004ft define voids as irregular re- 
gions of low number-density of galaxies, which may con- 
tain bright galaxies. Thus, by construction, voids are not 
empty even of v ery luminous an d likel y massive galax- 
ies. By contrast, iGottlober et al] i2003T) define voids as 
spherical regions which do not have massive objects (ha- 
los in this case). Voids also can be defined in a statis- 
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In this paper we define voids as the maximal non- 
overlapping spheres empty of objects with mass (or lumi- 
nosity) above a given value . For example, we could define 
voids as maximal spheres empty of Milky Way-size galaxies. 
While voids are empty of these galaxies, they could have 
fainter galaxies inside. See Figure 1 for a graphical repre- 
sentation of our definition. 

The first step to follow using voids as a test for large 
scale structure and galaxy formation models is to develop 
a robust algorithm to detect them, to calibrate it, and to 
obtain the statistical properties of voids for different cat- 
alogs both real and simulated ones. With this information 
and with the predictions made through the analytical for- 



malism we may be able to contrast different structure for- 
mation models. To achieve this goal we develop two algo- 
rithms. They are conceptually different but are based on 
the same definition of void. We develop them as comple- 
mentary tools. One algorithm is intended to search for all 
the voids in galaxy or dark matter halo samples and the 
other is developed to search for the rarest voids. 

Once we have the tools to detect the voids, we study 
statistical properties that will be used to test the structure 
formation models. These properties go from the void corre- 
lations to the redshift dependence of voids. In the present 
work, we apply the tools we mentioned above in order to 
study the statistic of voids in the 2dF Galaxy redshift Sur- 
vey. One important point that could provide clues on the 
galaxy formation processes is the galaxy contents of voids. 
In this work, we present the first results on the distribution 
of faint galaxies in nearby voids. 

In section 2 we briefly describe how to detect voids. In 
section 3 we present the statistics of voids found in cosmo- 
logical numerical simulations and compare the performance 
of our algorithms. In section 4 we present the voids that 
we have found in the 2dFRGS together with their statistical 
properties. In section 5 we develop a method to get from the 
redshift space coordinates, the real space ones. In section 6 
we present the statistics of the galaxies inside nearby rare 
voids, finally, in section 7 conclusions and discussions are 
presented. In Appendix A we give details of our void finders 
presented in section 2. Here we also provide tests in order 
to check the performance of them. 

Throughout this work we adopt a ACDM cosmology 
model with parameters Q m = 0.3 and Qa = 0.7. 



2 VOID DETECTION ALGORITHMS 

Once we have defined what is a void (as the maximal non- 
ovelapping spheres empty of galaxies brighter than a given 
magnitude), the next step is to develop an algorithm to be 
able to find them in galaxy or dark matter halo samples. 
The computation structure of the algorithm naturally will 
depend mainly on the void definition. Following the defini- 
tion we have stated in the previous section, the algorithm 
should try to find the maximal sphere that the void can 
accommodate. 

There are many algorithms in the literature inspired by 
the different void definitions (see e.g. Einasto et al 1989, 
Kauffman & Fairall 1991, El Ad & Piran 1997, Aikio & 
Mahoenen 1998, Gottlober et al. 2003, Colberg et al 2004). 
Aikio & Mahoenen 1998 and Colberg el al 2004 have de- 
veloped similar algorithms. Following their definitions of 
voids, they generate a field smoothing the point distribu- 
tion on a cubic grid and find the local minima; then, they 
put spheres around those minima filling the entire under- 
dense region. The radius of these voids are the effective 
ones taken from the sphere that contain the same volume 
as the void. Gottlober et al. 2003 have developed an algo- 
rithm based on the minimal spanning tree for halos selected 
by intrinsic properties, in this case with mass above a given 
value. Although they have searched for voids as local maxi- 
mal spheres, they did not do voids statistics in their work. 

Another algorithm in literature is the so-called void 
finder developed by El Ad & Piran (1997) and its modifica- 



© 0000 RAS, MNRAS 000, 000-000 




Figure 1. A graphic representation of our void definition. Voids 
are maximal non-overlapping spheres, in the observational do- 
main, which are empty of objects classified by some intrinsic prop- 
erty. For example, in galaxy samples our voids will be defined by 
galaxies more luminous than a given luminosity L (filled circles). 
Open circles denote galaxies with luminosity fainter than L. Note 
that for simplicity we do not show the open circles located outside 
the maximal sphere. 



tions done by Hoyle & Vogeley (2002) . This algorithm search 
for arbitrarily shaped regions delimited by the so-called wall 
galaxies in order to get the maximum volume of the void. 
They classify the galaxy sample in wall galaxies and field 
or voids galaxies by mean of a criteria that depends on the 
galaxy distribution itself, i.e. they define a length parame- 
ter l n such that any galaxy that does not have n neighbours 
within a sphere of radius l n is classify as field galaxy. Note 
that field galaxies could be, for example, bright galaxies. 
These field galaxies are removed before searching for voids. 
Once the classification is done, the algorithm search for the 
maximal spheres defined by the wall galaxies based on a cu- 
bic mesh. Once all the spheres are obtained, they define an 
overlapping parameter to discriminate if two spheres belong 
to the same void. Similarly to Aikio & Mahoenen (1998) and 
Colberg el al (2004), they finally fill the voids with spheres 
to get the maximum volume. Again, they define the effective 
radius of a void by means of a sphere that contain the same 
volume as the void. 

Note that, although this may be interesting for some 
studies (e.g. the shapes of voids), these kind of statistics de- 
grade in some amount the information available in the ac- 
tual galaxy distribution, so they might not be particularly 
powerful to conduct accurate statistical inferences. This is 
similar to what happens with the binned data: the best sta- 
tistical test using binned data is never better and usually 
worst than the best test using the raw data. 

To identify voids, we designed two algorithms that are 
conceptually different but both are based in our definition 
of voids. The algorithms are complementary. They help us 
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Table 1. Parameters of simulations 



Box Mass resolution Number of particles erg 

(h- 1 Mpc) (ft -1 M ) 

80 3.18 x 10 s 512 3 0.90 

120 1.07 x 10 9 512 3 0.90 

500 7.80 x 10 10 512 3 0.90 

to investigate the variety of aspects present in the statistics 
of voids. 

One of the algorithms, which we call CELLS void finder, 
was designed to search for all the voids in a galaxy or halo 
sample based on a computational grid. This grid define the 
working resolution. To determine the void centers, the code 
computes the distances between each of the empty grid cells 
and all the galaxies or halos in the whole observational do- 
main, keeping the minimum distance. Once we have the list 
with the minimum distances, we search for the local maxima 
which corresponds to the void centers. Obviously, the voids 
radius are those maximum distances. 

The other algorithm, which we denominate HB Void 
Finder is conceptually simple. This code searches for the 
maximal non-overlapping spheres with radius larger than a 
given value. First of all, we generate over the sample a large 
sample of random spheres of a given radius. After this, we 
check and keep the spheres that are empty of galaxies. We 
inflate these spheres until they reach the maximum radius. 
Finally, we eliminate the overlapping spheres keeping the 
maximal ones. This code is very accurate and computation- 
ally efficient to search the biggest voids in a galaxy sample. 
Detailed descriptions of the two algorithms are given in the 
Appendix. 



3 VOID STATISTICS IN SIMULATIONS 

3.1 Numerical simulations 

We perform a series of numerical simulations with the Adap- 
tive Refinement Tree code (ART, Kravtsov et al. 1997) and 
the Tree-SPH code GADGET (Springel, Yoshida & White, 
2001). Dark matter halos are identified in the simulation 
by the Bound Density Maxima algorithm (BDM, Klypin & 
Holtzman 1997; Klypin et al. 1999). 

In this work we detect and study voids in simulation 
boxes of 80/i _1 Mpc, 120/i _1 Mpc and 500/i -1 Mpc size. The 
parameters of all the simulations are summarized in Table 
1. With these boxes we have enough volume to study accu- 
rately the void statistics and compare the results obtained 
with our two void finders. 

3.2 The statistics 

In Figure [5] we show the number density of voids as a func- 
tion of their radius obtained with our void finders (squares 
for CELLS void finder and circles for the HB void finder). 
These results were obtained for voids defined by two halo 
masses (1 x 10 12 M Q (open symbols) and 5 x 1O 12 M (filled 
symbols)). 

In order to search for voids with the HB Void Finder, we 
have generated 1 x 10 7 , 2 x 10 7 , 5 x 10 7 trial spheres for the 
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Figure 2. Number density of voids in numerical simulations ob- 
tained with our two void finders (squares for the CELLS void 
finder and circles for the HB void finder) . These results were ob- 
tained using our 80/i _1 Mpc and 120/i _1 Mpc boxes. The open 
circles and squares denotes the number density of voids defined 
by halos with masses larger than 1 X 10 12 M Q while the filled 
circles and squares are the number density of voids defined by 
halos with masses larger than 5 X 10 12 M„ . Here, we can see that 
both algorithms get the same results as voids are bigger (symbols 
are overlapped). However, as we go to more common voids the 
differences between codes are bigger. This is due to the fact that 
the HB void finder is not so efficient detecting common voids (see 
text for details). 



80/i _1 Mpc, 120/i _1 Mpc and 500/i _1 Mpc boxes respectively. 
In the case of the Cell Void Finder we defined a working 
resolution of 0.5/i _1 Mpc for the 80/i _1 Mpc and 120/i _1 Mpc 
boxes and 1.2/i _1 Mpc for the 500/i _1 Mpc box. 

The obtained statistics shows that both algorithms 
gives very similar results. These results are indeed very use- 
ful to learn about the performance of the void finders. Both 
algorithms give exactly the same results for the largest voids 
(these are rare voids), being the HB Void Finder the fastest 
and more precise. However, as we go to more common voids 
the Cell Void Finder has the best performance. The small 
differences between both codes for smaller voids are due to 
the fact that we need more realizations of the trial spheres 
in the HB Void Finder. As it is expected, the voids are more 
numerous and larger the more massive the halos defining the 
void are. 



2dFRGS is taken from the APM galaxy catalogue (Mad- 
dox et al. 1990). The spectroscopic targets are galaxies with 
extinction corrected magnitudes brighter than bj = 19.45. 
The median depth of the survey is z ~ 0.11. The final 
data releases contains a total of 221,414 high quality red- 
shifts. There are two large contiguous survey regions, one 
in the south Galactic pole (SGP) and another one towards 
the north Galactic pole (NGP). There are also a number 
of random fields which we have eliminated from our void 
search. Full details of the 2dFGRS can be found in Colless 
et al.(2001, 2003). 

In order to search for the voids we have selected from 
2dFGRS two rectangular regions: the region in the SGP de- 
fined by -34° 40' < S < -25° 12' and 21 h 49' < a < 3 h 26' 
and the region in the NGP defined by -4°35' < S < 2° 17' 
and 9 h 33' < a < 14 h 54'; that is, ~ 690 and ~ 550 sq. de- 
grees respectively. 

The 2dFGRS is magnitude-limited, i.e. the survey has 
been constructed by taking spectra of galaxies brighter than 
a fixed apparent magnitude of fej = 19.45. However the sur- 
vey is homogeneously complete up to 90% at fej = 19.0 (see 
Norberg et al. 2002). A magnitude- limited galaxy survey is 
not uniform in space, since intrinsically faint galaxies have 
been observed only if they are relatively nearby, while at 
large distances only bright galaxies will be targeted. This 
non uniformity of the magnitude-limited survey must be 
taken into account in order to make our void analysis. There 
are mainly two ways to deal with this; one is to use the selec- 
tion functions provided by the 2dFGRS and another more 
simple way which we have followed here is to build volume- 
limited samples. 

We construct four volume-limited samples, two for 
each survey region (SGP and NGP), one with depth 
Dmax = 406.15/i~ 1 Mpc which corresponds to z = 0.14. 
For this D max we have a limiting absolute magnitude 

M lim = _ 19 32 + 51ogfe and the Qther w j th depth = 

571.71ft Mpc corresponding to z = 0.2 and a limiting ab- 
solute mag nitude M l b \ m = -20.181 + 51ogft. All distances 
are comoving ones. In Table 2 we give the properties of the 
volume-limited samples. Now, we have guaranteed that any 
galaxy brighter than M^™ 1 is observed in our volume. We 
have computed the absolute magnitudes from the apparent 
magnitudes assuming a ACDM cosmology and applying the 
needed corrections to model the change in the galaxy magni- 
tudes due to the redshifted &j filter bandpass (fc-correction) 
and to account for the galaxy evolution (e-correction). These 
corrections for each galaxy are given in Norberg et al.(2002). 

To test the spatial homogeneity of our SGP1,2 and 
NGP 1,2 volume-limited samples we have computed the av- 
erage of the cube of the radial distances, i.e. a modified 
version of the V m ax test (Rowan- Robinson 1968), which for 
a homogeneous sample must satisfy: 



4 VOID STATISTICS IN THE 2dF GALAXY 
REDSHIFT SURVEY 

4.1 The galaxy samples 

In the present work we use the 2dFGRS final data release 
(Colless et al. 2003) to obtain the void statistics in large 
galaxy redshift samples. The source galaxy catalogue of the 



((r^) 3 )4 



y r ma x' ' 2 y/\2N' 

A' = 



(1) 



{1 + N <Z{r) >) 



where r is the comoving distance to each galaxy, r ma x 
is the same as D max , the maximum distance of the sample, 
N is the number of galaxies (given in table 2) and < £(r) > 
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Table 2. Parameters of our Volume Limited Samples. 



Name 






Dmax 


Volume 


■*Y gxs 




M} >3 — 51og h 




/i^Mpc 


10 6 h- 3 Mpc 3 




SGP1 


-19.32 


0.14 


406.15 


4.693 


22037 


SGP2 


-20.181 


0.20 


571.71 


13.088 


14475 


NGP1 


-19.32 


0.14 


406.15 


3.749 


19695 


NGP2 


-20.181 


0.20 


571.71 


10.456 


11404 



is the average value of the correlation function over all pair 
of galaxy positions within the sample, defined as: 



<£>= 



1 

V* 



£(|ri - r 2 |) dridr 2 



(2) 



where V is the volume of the sample. Assuming for £(r) 
(Peebles 1980): 



(3) 



-5Ah~ 1 Mpc J 

we found < f(r) >= 6.7910 x 10~ 3 . It must be noted that 
we have used eq.@ for any value of r, while in fact, it is 
known that for r >> 10/i~ 1 Mpc £(r) must be close to the 
fourier transformed of the linear power spectra. This would 
lead to smaller values of < {(r) >. However, this would not 
affect our analysis too much. With this value we find that 
for a homogeneous sample: 



<(y-) 3 K ± - 024 - 

' max 



(4) 



The actual 77 values for the SGP1 and NGP1 are r){SGPl) = 
0.524 and f](NGPl) = 0.51. For the SGP2 and NGP2 to be 
homogeneous (assuming that < £ (r) > takes the same values 
as in the previous sample): 



^<(^)"> = 5 ± 0.0253, 



(•») 



and the actual values are tj(SGP2) = 0.511 and r/(NGP2) = 
0.475. Therefore, we can conclude that we do not detect 
inhomogeneities . 




-500 -400 -300 -200 -100 
x [Mpc/h] 




-500 -400 -300 -200 -100 
x [Mpc/h] 



4.2 The void statistics 

As described in section 3.2, once we have constructed our 
samples, we have to define which objects will define the 
voids. In the SGI and NGP1 volume limited samples we 
present in table 2 we search for voids defined by two types 
of galaxies. In the one hand, we search voids defined by 
galaxies brighter than M&j = —19.32 + 51og7i. Using these 
samples we also find voids defined by galaxies brighter than 
Mjj = — 20.5 + 51og/i. The number of galaxies brighter than 
M bj = -20.5 + 51og/i is 2427 for the SGP1 sample and 
2074 for the NGP1. In the SGP2 and NGP2 samples, we 
have searched for voids defined by galaxies brighter than 
M},, = —20.181 + 51ogft. In Figure [3] we show a plot with 
the voids we have detected in the NGP1 sample. In table 3 
we summarize the results for the NGP1 and SGP1 samples. 
Note that, as we saw in the simulations, voids defined by 
bright galaxies are larger than the ones defined by fainter 
ones. In table 4 we present the statistics for the NGP2 and 
SGP2 samples. We have obtained these voids applying the 
HB algorithm. In the radius ranges that we present here both 



Figure 3. Voids in the 2dFGRS. We show the maximal spheres in 
the NGP1 volume-limited sample of galaxies (rilled circles). The 
upper panel shows the voids with radius larger than 7.5/i _1 Mpc 
defined by galaxies brighter than M bj = —20.5 — 51og h. There are 
2074 galaxies in this sample. In the lower panel we show the voids 
with radius larger than 13.0/i _1 Mpc defined by galaxies brighter 
than M b = —19.32 — 51og h. The number of galaxies is 19695. 
Some galaxies lie inside the maximal spheres due to projection 
effects. 



codes have a similar performance. We generate 8 x 10 7 trial 
spheres to search for voids larger than 7.5/i _1 Mpc which are 
our main interest. 

We have also calculated the VPF which is the probabil- 
ity that a randomly located sphere of fixed radius contains 
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Table 3. Statistics of Voids in our NGP1 and SGP1 samples. 
Nngpi an d NsGPl a re the number of voids larger than the given 
radius. We present the void statistics for two different defining 
galaxies. 



Radius 
(Ti-ijVlpc) 


Nngpi 


NsGPl 


M Um _ _ 19 32 _ 5\ og h 


7.5 
10.0 
12.0 
13.0 
14.0 


136 
48 
11 

6 

1 


220 
80 
25 
11 
6 


Al! im = -20.5 


— 51ogfo 




13.0 
14.0 
16.0 
17.0 
19.0 


28 
24 
14 
7 
2 


43 
35 
26 
17 
6 



Table 4. Statistics of Voids in our NGP2 and SGP2 samples. 



Radius N n qp2 A r SGP2 
(/i-ijVlpc) 



M Um = _ 20. 181 - 51ogh 



13.0 


68 


101 


15.0 


36 


53 


16.0 


25 


31 


17.0 


9 


23 


18.0 


5 


11 


21.0 




2 



So that, taking into account the correlations in the sample, 
the rms of the VPF is given by: 

rms(P (r)) ~ g(r)^ 2 (1 _ W n v (R))^ 2 (10) 

where wn v (R) is as defined in Eq. l2H , I n the rare voids 
limit , i.e. when Pa <<< 1 it is shown that feetancort-Riio I 
Il992ft : 

g(r) ~ 9.20 (11) 

For voids with radius larger than 12/i _1 Mpc in the 
SGP1 and NGP1 samples and larger than 17/i _1 Mpc in the 
SGP2 and NGP2 samples, the asimptotic value g(r) 1,/2 — 
2.86 is a good approximation. For smaller values of r, g{r) x ^ 2 
is somewhat smaller (no less than ~ 2). So in figure Q, 
where the asimptotic value of g(r) have been used, the er- 
ror may be slightly overstimated for small values of r. It 
must be noted, however, that the probability distribution of 
the fluctutations of the VPF around the mean is strongly 
non-Gaussian for small values of N(r); most fluctuation be- 
ing quite smaller than the rms and a few of them being 
very large. So, to decide the compatibility between couples 
of measurements, this fact has to be taken into account. In 
the present case, however, this problem do not arize because 
all results are within the error bars) 

Our results are in good agreement with the previous 
calculations of the VPF (Hoyle & Vogeley 2004, Croton et al. 
2004). In Croton et al.(2004) the VPF was obtained for the 
whole 2dFGRS while Hoyle & Vogeley (2004) have studied 
the VPF for both strips (NGP and SGP). In this last result, 
they found that the VPF for both strips are not compatible 
for large spheres (see figure 7 in Hoyle & Vogeley 2004). 
This is be due to the fact that they underestimated the 
error bars. We have shown above that using a more accurate 
expression for the rms of the VPF we obtain that both strips 
are compatible. 



no galaxies. Figure 0] presents our VPF, which we find for 
the SGP1,2 and NGP1,2 samples. We calculate the rms of 
the VPF in the following way. Assuming that voids are in- 
dependent, we get from the central limit theorem that 



\Po(r)) = 



N(r) 



'< Av > - < Av > z 



(6) 



where Av is the volume where the center of an empty sphere 
of radius r may be moved so that it remains empty. V is the 
volume of the sample and N(r) is the number of voids found 
with radius larger than r in that volume. The parenthesis in 
the right hand side of eq.Q divided by V 2 is the contribu- 
tion of each individual void to the v ariance of the VPF. 
On the other hand we have from lBetancort-Riio I (Il992f) 

that 

N(r) Mr) (7) 
V <Av>' (> 

Now we square this expression and substitute it in expression 
@ to obtain: 



rms 2 (Po(r)) = 



PS(r) 



N(r) 



(Sir) - 1) 



9(r) 



< Av 2 > 

< Av > 2 



(8) 
(9) 



4.3 Voids spatial distribution 

If we neglect the redshift dependence of the power spectra, 
the spatial distribution of the voids found in a statistically 
homogeneous sample must be statistically homogeneous it- 
self, but conditioned to the fact that the maximal sphere 
must lay within the sample. This fact implies that the cen- 
ter of the maximal spheres can not occupy all the volume 
of the sample. The available volume, V(R), for the maximal 
spheres of radius R is: 



V(R) 



R pS +AS pa +Act-B(r,R,S 



r 2 cos 5da dfyl&i) 



JS J ao +B(r,R,S) 



B(r, R, 5) = sin" 



xf ^-\f) 
cos<5 



Eg. 1121 may be considerably simplified by carrying out the 
integrals over a and 5 keeping B fixed (using its value at 
So + AS/2). This is a very good approximation since AS « 
1. So, we could write: 



V(R) 



P(r, R)r 2 dr 



(13) 
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Figure 4. Void Probability Function for the 2dFGRS. We show 
the VPF for the SGP1,2 samples (open and filled circles respec- 
tively and also for the NGP1,2 samples (filled and open triangles 
respectively) . 



P(r,R) = [sin(<5 + AS - sin - ^-)) - sin(5 )] x 



[Aa - 2 sin 1 ( 



sm (-) 

AS 

2 



cos(<5 ■ 



;)] 



(14) 



A8,Aa are, respectively, the widths of the strips in declina- 
tion and right ascension, while <5o is its southernmost decli- 
nation. Although expression 1141 for V(R) is not exact, its 
percentual error is completely negligible (less than 10~ 5 ). 

Using P(r) we may obtain the mean of the cube of the 
distance (from the observer), r, to the center of the maximal 
sphere with radius larger than R': 



P(r,R)(-^— fr 2 dr 



V(R) 



where R is the mean radius of all maximal spheres larger 
than R'. 

In a similar manner we may obtain the variance for 
the estimate of r/(R/h~ 1 Mpc) from the void list we have 
obtained in the 2dFGRS. The values found in the SGP1,2 
and NGP1,2 for voids defined by galaxies brighter than 
— 19.32 ± 51og/i are listed in table 5). 

From the analysis of these results we may conclude that 
the voids are essentially compatible with homogeneity. There 
is, however, a slight, although statistically significant, trend 
to values lower than homogeneous that we will discuss in 
subsection (3.5). 

Now, we will consider the nearest neighbour statistics 
which together with the void one point statistic we have 
studied above is sufficient to grant the validity of the statis- 
tical analysis in the following subsections. 



Radius 


V 


V 


(h-iMpc) 


NGP1 


SGP1 


7.5 


0.512 (0.563 ±0.02) 


0.503 (0.53 ± 0.02) 


10.0 


0.579 (0.579 ± 0.038) 


0.508 (0.54 ±0.03) 


13.0 


0.66(0.615 ± 0.116) 


0.433 (0.558 ± 0.08) 




NGP2 


SGP2 


18.0 


0.475 (0.611 ±0.085) 


0.588 (0.556 ± 0.06) 



Table 5. The values of r)(R') found in the SGP1,2 and NGP1,2 
samples. The values within the parenthesis are la predictions as- 
suming that the center of the maximal spheres are uniformly dis- 
tributed conditioned to lay entirely within the sample (see text). 



The maximal spheres are chosen so that they do not 
overlap, in order that they very rarely corresponds to the 
same connected underdensity. So, by construction, for the 
maximal spheres with radius larger than R the two point 
correlation function of their centers is —1 at least to a dis- 
tance of 2R. The main question is how is the correlation at 
larger distances. Note that if more than one maximal sphere 
were associated with the same underlying connected under- 
density the correlation would be positive for distances some- 
what larger than 2R. However, as there is typically only one 
maximal sphere per underdense region and in the standard 
scenario of structure formation these regions are essentially 
uncorrelated (for the relevant R values) (Colberg et al. 2004), 
we do not expect to find correlations between the maximal 
spheres. 

To test whether voids are correlated we compare the 
actual nearest neighbour statistics with the theoretical pre- 
dictions corresponding to centers which are uncorrelated for 
distances larger than 2R and completely anticorrelated for 
smaller distances. For maximal spheres with radius R the 
theoretical predictions for the mean and quadratic mean 
distance to the nearest neighbour (< D ao > and < D 2 a > 
respectively) is given by: 



< D 00 >= 



< Dt„ > 



(Co 



Ann v (R) 



-1/3 



-O-iu ) ^1/3 



dw 



(16) 



^ 4^(fl ) x 2/3 
^-(2R) 3 n v (R) 



- (w - wo) w 2/3 dw(17) 



where fi v (R) is the mean number density of the maximal 
spheres larger than R and R is the mean radius. The sam- 
pling error of < D ao > when estimated using N centers is: 

with these expressions we may compute the mean and the 
variance of q: 



R 



h~ 1 Mpc 



= {n v {R)) 1/3 E{< D oa >) 



(19) 



where E(< D oa >) is the estimate of < D oa > using the 
void sample. For typical voids we find for q: 
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<7sgpi(7.5) 
<7atgpi(7.5) 

<?SGP2(13.0) 
<7iVGP2(13.0) 



0.978 (0.958 ± 0.006) 
1.029 (0.958 ± 0.008) 
0.966 (0.879 ± 0.01) 
1.1 (0.879 ±0.013) 



where R = 7.5/i~ 1 Mpc (R = g.SSft^Mpc) correspond to 
SGP1, NGP1 and R = 13.0/i _1 Mpc (R = 15.08/i _1 Mpc) to 
SGP2, NGP2. The values in the parenthesis correspond to 
la predictions obtained using Eqns. ll7t and 1181 . 

It is apparent that the estimated values are slightly 
shifted upwards. The bias is large in terms of the rms's 
but we can not conclude from this that the centers are an- 
ticorrelated since the predictions do not account for border 
effect, which in this cases is small for the SGP samples but 
larger than the rms. 

for rare voids, i.e. R = 
NGP1 samples (Rsgpi = 
13.42^* 1 Mpc) and R = 
NGP2 samples (Rsgp2 = 
Mpc) we find: 



On the other hand, 
13.0/i _1 Mpc for the SGP1, 
14.14/i _1 Mpc,^]vGPi 
18.0/i _1 Mpc for the SGP2, 
19.5/i- 1 Mpc,i?ivGP2 = 18.45/1" 



<?sgpi(13.0) 
gjvGPi(13.0) 

<?SGP2(18.0) 
l?]VGP2(18.0) 



0.959 (0.716 ± 0.004) 
1.398 (0.716 ± 0.006) 
0.971 (0.724 ± 0.046) 
1.071 (0.724 ± 0.070) 



The upward bias with respect to the values for the uncorre- 
cted centers (without accounting for border effects) is now 
large even in the SGP. To asses the border effect we have 
simulated a random set of spheres with R = 19.5/i -1 Mpc 
(the mean radius of the maximal spheres with radius ^ 
18.5/i _1 Mpc in the SGP2 sample) with the condition that 
they lay entirely within the catalogue and do not overlap, 
then we obtain the nearest neighbour statistics. The average 
over several realizations of the mean nearest distance is: 



n„(18.0) < D 00 >= 0.88 ± (0.05 ± e) 



(20) 



which is quite larger than the value predicted without bor- 
der effect (0.724). This is not enough, however, to account 
for the measured value (0.971). The same result is found 
for Zmax = 0.14 since the border effect in this case must be 
very close to that in the previous case. From these results we 
might conclude at least at the 99% confidence level (being 
rather conservative about the value of e) that the center of 
the maximal spheres show some anticorrelation. We prefer, 
however, to conclude simply that those centers are basically 
uncorrelated, which is the necessary result for the subse- 
quent statistical inferences, and leave open the question of 
the possible existence of a weak anticorrelation, which shall 
be treated in more detail in a future work. Incidentally, it 
must be noted that a slight anticorrelation of the centers 
arise naturally in the standard scenario, being compatible 
with the results of Colberg et al. 2004. The reason is that 
the conexed underdense regions are somewhat larger than 
the maximal spheres they contain. 



4.4 Compatibility between SGP and NGP 
samples 

Here we compare the void statistics found in our samples 
in order to asses their compatibility. To this end we must 
estimate the mean number density of voids, n v (R), in each 
sample and obtain their sampling error. To estimate n v (R) 
we simply divide the number of voids, N(R), with radius 
larger than R, by the mean available volume V(R). To es- 
timate the sampling error we must take into account that, 
although the center of the maximal spheres are basically 
uncorrelated for r > 2R, they are totally anticorrelated for 
r < 2R (we use a model in which all spheres have radius 
R). In this case the number of spheres in a given volume 
do not follow a Poissonian distribution, but a negative bi- 
nomial one (Betancort-Rijo 1999). For this distribution the 
sampling error of the estimate of n v (R) could be obtained 
by means of the following expression: 

n v {R) 



rms(n v (R)) = 



(N(R)) 1 / 2 



(1 



>(R)) 



1/2 



(21) 



w^^R 3 [ l + 2.73n v (R)^R 3 



4 V 2R 
For SGP1 we find: 

n„(7.5) = 9.364 ± 0.164 x 10 
n„(10.0) = 3.98 ± 0.11 x 10" 5 
ft„ (13.0) = 7.26 ± 1.70 x 10" 6 
For NGP1 we find: 



_ 3 n v {R)-y 3 1 n v (R)-y\ 
A * 5 ' 8 l 2R ' 



9.597 ±0.214 x 10" 
4.27 ±0.15 x 10~ 5 
6.56 ±2.10 x 10~ 6 



ra„(7.5) 
ft* (10.0) 
n„(13.0) 

The corresponding R's are: £(7.5) = 9.58 and £(10.0) = 
11.35 (the mean over NGP and SGP), £(13.0) = 14.14 in 
the SGP1 and £(13.0) = 13.41 in the NGP1. 

It is apparent that the three pair of values are compati- 
ble. It is true that for the SGP1 the galaxy density is roughly 
a 10% smaller than for the NGP1 and that the th eoret- 
ical expression jPatiri. Betancort-Riio fc Pradal 2004) pre- 
dict about a 20% enhancement of the density of voids with 
R ^ 13h _1 Mpc in the SGP1 over that in the NGP1, and 
smaller differences for more common voids, but this differ- 
ences do not show up above the sampling errors. 

Similar results were found for the SGP2: 



ra„(13.0) = 

ra„(15.0) = 

ft* (18.0) = 
and NGP2, 

ra„(13.0) = 

n„(15.0) = 

n„(18.0) = 



1.685 x 10~ J 
9.49 x 10~ 6 
2.978 ±0.58 x 10" 

2 x 10 -5 
1.202 x 10~ 5 
2.22 ±0.69 x 10~ ( 



Although the implications of this results for the large 
scale structure formation models is left to for a future work, 
it is interesting to note that they are generally in good agree- 
ment with the predictions of the standard model. 
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4.5 Redshift dependence of void number densities 

We have checked that the galaxies in our samples are, within 
the sampling errors, homogeneously distributed over the 
sample volumes. In this situation, the only possible expla- 
nation of an statistically significant departure of the void 
distribution from the homogeneity must be the redshift de- 
pendence of the properties of the large scale structure. 

Using a theo retica l framework 

iPatiri. Betancort-Riio fc Pradal |^004) to compute the 
number density of voids in the distribution of dark matter 
halos, we found that the predicted number density of 
voids larger than 13.0/i _1 Mpc for M^ m = -19.32 + 51og/i 
decreases by 28% from z=0 to z—0.1 and a further 28% 
when going to z=0.2. For M^ m = -20.181 + 51og/i and 
R > 18/i _1 Mpc the void density decreases by 26% from z=0 
to z=0.1 and another 28% from z=0.1 to z=0.2. Where, 
in order to link light with dark matter halos, we assumed 
that there is one galaxy in each dark halo. These numbers 
suggest the possibility of a measurable effect. To this end 
we conduct first a test where we divide every sample into 
two bins (a 'near' region and a 'far' region) both with the 
same volume available for the voids considered. Then we 
compare the total number of voids larger than the chosen 
radius in the near regions with the total number in the far 
regions. If the first number is significantly larger than the 
latter this should be interpreted as evidence for a redshift 
dependence. 

The problem now is that the effect we are trying to 
measure is strong only for the rare voids which have poorer 
statistics. We reach a compromise between both trends. We 
have chosen voids with radius (R m i n ) larger than 13/i _1 Mpc 
for the Zma X = 0.14 sample and 18/i _1 Mpc for the z m ax = 
0.2 sample. In order to divide a sample into two subsamples 
with the same volume available for the centers of the voids, 
we first obtain the mean radius (R) of the voids larger than 
Rmin, and, using Eq. H2H with R = R, we obtain the availabe 
volume of the whole sample (which has already been done 
in section 4.3). Then, we use again 111 2t with R — R but re- 
placing r ma x with rii m + R and search for the value of ru m 
giving for V half the value corresponding to the whole sam- 
ple. If voids were uniformly distributed their centers would 
lie above and bellow ru m with equal probability. We have 
explicitly checked this fact with numerical simulations. For 
the sample SGP1, we have chosen R m in = 13/i _1 Mpc, St..) 
R = 14.14/i _1 Mpc which imply n im = 371.7/i _1 Mpc. 9 
voids were found with their centers bellow this distance and 
2 above. Proceeding in a similar way with the other sam- 
ples, we found 3 voids closer than 349.1/i _1 Mpc and 3 voids 
further for the NGP1. For the SGP2 we found 5 voids closer 
than 475.0/t _1 Mpc and 8 further, while for the NGP2 we 
found 4 voids closer than 490.2fe _1 Mpc and 1 further. So, 
in total we have 21 voids in near regions and 14 in the far 
regions. 

With these numbers we can infer at least at the 88% 
confidence level the presence of a redshift dependence. How- 
ever, this is only a marginal evidence. To make more patent 
this evidence we use a more efficient test. This test uses the r\ 
values obtained in 4.3 which are shown in table 5. We use the 
r\ values correpsonding to voids larger than 13/i _1 Mpc in the 
NGP1 and SGP1 samples, and voids larger than 18/i _1 Mpc 
in the NGP2 and SGP2 samples. From each of those four 



r) values we substract its expected value when uniformity 
is assumed (first number in the parenthesis in table 5) and 
divide the result by the corresponding rms (second num- 
ber in the parenthesis). Each of these quantities follow a 
Gaussian distribution with zero mean and variance 1 under 
the uniformity hipothesis. So, the sum of the four quantities 
must follow a Gaussian with zero mean and variance 4 and 
the probability that these sum be smaller than the actually 
found result is: 



erfc 



y 4 



rtnsCrii) 



2 V2 



0.005 



(22) 



So, we may infer at the 99.5% confidence level the existence 
of non-uniformity on the void distribution, that, given the 
uniformity of the galaxy distribution can only be explained 
by the growth of density fluctuations as redshift decreases. 
Note that the rms(rf) given in table 5 do not account for the 
anticorrelation between the voids centers (when r < 2R). 
The last factor in expression (12 U approximately accounts 
for this fact. If we were to use it, all the rms(rj) would be 
smaller and, consequently, we could reject the uniformity 
hypothesis at a larger confidence level. However, we prefer 
to give the conservative value quoted above since it has not 
any uncertaninty. 



5 REDSHIFT SPACE DISTORTIONS 

Redshift galaxy maps like the 2dFGRS or SDSS are dis- 
torted by the peculiar velocities of galaxies along the line of 
sight. This effect, which produce deformations such as the 
"finger of God", is due to the fact that the distance to the 
survey galaxies is obtained by means of the Hubble distance 
which is obtained with the total velocity (the Hubble flow 
plus the peculiar velocity). 

We present in this section a method to correct this effect 
for the galaxies around voids. We first suppose that if in the 
distorted space we have a void of radius R, we will have 
the same void of radius R in the real space. This is a good 
approximation because the distortion around a void cause 
an elongation along the line of sight of the maximal sphere 
without changing the "transversal" radius, i.e., although the 
distortion is not volume conservative, the maximal radius 
will be approximately the same in both spaces. 

With this assumption, we have in the position 
of the void in the distorted space a mean under- 
density, So, within the maximal sp here with radius R 
JPatiri. Betancort-Riio fc Pradall2004) : 



S = 



S P(So/R)dS 



(23) 



where P(So/R) is the probability distribution for So within 
a maximal sphere of radius R. 

We associate with this void a matter distri- 
bution given by the mean profi le, 8(r/So, R)- In 
IPatiri. Betancort-Riio fc Pradal <l2004l) we derive this 
profiles. So, it has a peculiar velocity profile V(S(r),r) 
given by the spherical collapse model (Betancort-Rijo et al. 
2005): 



V(S(r),r) 
H 



-0.51 



1 + 5 



8i(8)( 



dSijS) , 
dS ' 



(24) 
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Mm n lir>T 




Radius \(h _1 Mnrll 




of voids 


Distorted 


Real 


Corrected 


1 


18.17 


17.48 


17.58 


5 


16.51 


16.19 


15.89 


10 


15.70 


15.09 


14.93 


15 


15.42 


14.57 


14.58 


20 


14.93 


14.36 


14.15 



Table 6. The void statistics found in the 3 catalogs generated 
from the millennium galaxy catalog: the redshift distorted, the 
real and the corrected. The first column show the number of voids 
found with radius larger than the values given in the 3 columns 
on the right corresponding to each catalog. 



where H is the Hubble constant, Si (S) is given in Sheth 
& Tormen (2002) and 8 is the mean profile (note that for 
simplicity we do not show the dependence on r, So and 7?). 
We have derived an analytical approximation for S(r/So, 7?) 
which gives good results for r ^ 1.57?: 



8{r/5 ,R) ~ So + (0.1645 + 0.085<5 )(— ) (25) 
So that, we correct the galaxy "measured" distance r(z): 
(Tt--& e ).-& V{S(r/S ,R),r) 



r re ai ^ r(z) - 



I 



H 



(26) 



where ~5? is the vector to the galaxy and Hf c is the vector to 
the center of the void. We apply this correction to galaxies 
with distance to the center of the void ^ 1.57?. Once we 
have the corrected catalogue, we search for voids over this 
new catalogue. 

In order to test our technique, we have applied it to a 
simulated catalogue, the Millennium Run galaxy catalogue 
(Springel et al 2005). The public available catalogue 1 con- 
tains a total of about 9 million galaxies in the simulation 
box of 500/i~ Mpc. For each galaxy, it is available the po- 
sition and velocity, the total and bulge galaxy magnitudes 
in 5 bands (ugriz SDSS bands), the total and bulge stel- 
lar mass, cold, hot and ejected gas mass, the black hole 
mass and the star formation rate. The dark matter halos in 
the simulation were populated using semi-analytic models 
of galaxy formation (see Croton et al. 2005 for full details). 
We have constructed from the full simulated box a smaller 
one of 250/i Mpc. With this box we have enough volume 
to study large (and rare) voids. Also, as we have available 
in the original catalogue the coordinates in real space, we 
have constructed another box of 250/i -1 Mpc with the same 
galaxies but with the coordinates in redshift space. 

We have applied our HB void finder to search for voids 
larger than 13.0/i~ 1 Mpc in both distorted and real cata- 
logues. With the list of voids in the distorted catalogue, we 
have applied the correction to those galaxies that lie within 
1.4 radius of the voids. After we have obtained the corrected 
galaxy catalogue, we run the void finder over this catalogue. 
In table (6) we show the statistics of voids in the 3 different 
catalogs: the distorted, the real and the corrected. 

From the study of the simulated catalogues we learn 
that, although the correction may be larger than 3/i _1 Mpc 



(for 7? ^ 16/i _1 Mpc) for galaxies close to the line of sight to 
the center of the voids, the correction for the radius of the 
maximal sphere is, as expected, much smaller. Even so, the 
difference is not negligible for sufficiently rare voids. We find 
that the 20 largest voids in the simulated catalogues are on 
average slightly smaller (~ 5%) in the corrected catalogue. 

From the void statistics found in the corrected SGP1 
and NGP1 samples we find that for the corrected SGP1 the 
ten largest voids are on average 0.83fa -1 Mpc smaller than 
for the uncorrected one and that the number of voids larger 
than 12/i _1 Mpc in both samples is 25 for the uncorrected 
case and 14 for the corrected one. So, since the strongest 
constraint on the models comes from relatively rare voids, 
it seems likely that the corrected catalogues must be used 
in order to be able to make accurate inferences. 



6 GALAXIES IN NEARBY VOIDS 

In this section we study the galaxy content of the nearby 
rarest voids in our 2dFGRS samples. To this end, we have 
selected the voids from our SGP1 volume-limited sample 
described in Section 5.1 (see Table 3). The voids are denned 
by galaxies brighter than —19.32 + 5 log/t. We have searched 
for faint galaxies down to M^ m = —18.3 + 5 log/i inside 
nine voids with radius larger than 13/i _1 Mpc in a volume 
limited sample up to z=0.095. These voids are uncommon 
(with mean radius of 14.0/i _1 Mpc) due to the fact that a 
randomly placed sphere with radius equal to the mean size 
of these voids should contain about 50 galaxies brighter than 
— 19.32 + 5 log/i. In total we find 130 faint galaxies inside 
these 9 rare voids, i.e. on average 14 galaxies per void. We 
have estimated the number density contrast of the galaxies 
located inside these voids by 



Sgal 



(27) 



1 It can be downloaded 

http: / / www. mpa-garching. mpg.de / galform/agnpaper / 



from 



where n vo id is the number density of galaxies inside the void 
and n is the number density of galaxies in the field. For the 
galaxies inside the voids we obtain S ga i = —0.87. In fig- 
ure we display a central 6/i _1 Mpc thick slice for three of 
these nine voids. We can see that, despite the fact that these 
voids are highly empty, the faint galaxies populating them 
show interesting structures like filaments. Nevertheless, most 
galaxies are placed close to the borders of the voids being 
their centers much emptier. Note that these galaxy patterns 
are similar to those found in voids in high resolution numer- 
ical simulations by Gottober et al. 2003. In figure|S]we show 
the mean number density of faint galaxies in our rare voids 
as a function of distance to the void center (normalized to 
the void radius). 

We have also studied the galaxies inside common voids, 
i.e. voids whose underdensities are not too big. We have se- 
lected from the SGP1 sample the voids defined by galaxies 
brighter than —20.5 + 5 log/i. We found five voids larger than 
15.07i _1 Mpc up to z=0.095. For these radius, a randomly 
chosen sphere should contain only 3 galaxies brighter than 
—20.5 + 5 log/i. In total there are 666 galaxies fainter than 
-20.5 + 5 log/i down to M^ m = -18.3 + 5 log/i. These voids 
contain on average 10 times more galaxies than the rare 
voids discussed above, being the number density contrast of 
galaxies S ga i = —0.54. In figure |7| we show an example of 
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Figure 6. Averaged enclosed number density profile of faint 
galaxies (-19.32 + 5 log/i < M b] < -18.3 + 5 log/i) in nearby 
voids as a function of the distance from the center of the void (in 
void radius units). The mean number density of galaxies in the 
field for these magnitude band is 7.0 X 10 -3 (/i _1 Mpc) -3 . 



Figure 7. Galaxies inside a common void. We show a central 
Sh~ Mpc thick slice of a 18.25/i — *Mpc void denned by galaxies 
brighter than —20.5 + 5 log/i (filled circles). The galaxies inside 
the void are fainter down to Af' lm = —18.3 + 5 logh (open circles). 



a 8ft _1 Mpc thick slice of a 18.25/i _1 Mpc void in this sam- 
ple. The galaxies inside this void almost fully fill the void 
(open circles). This is not surprising due to the fact that the 
density contrast of these voids is not too big. 



7 DISCUSSION & CONCLUSIONS 

We have developed two robust and accurate algorithms to 
detect non-overlapping maximal spheres in halo or galaxy 
samples. We have applied them to several numerical simu- 
lations boxes in order to study the performance of the algo- 
rithms. The results found were very satisfactory. 

We have then studied the void statistics in the 2dF- 
GRS. We have detected ~ 350 voids with radius larger than 
7.5/lT 1 Mpc defined by galaxies with M b , < -19.32 + 51og/i 
and ~ 70 voids with radius larger than 13.0/i _1 Mpc defined 
by galaxies with Mb 3 < — 20.5 + 51og/i in the volume limited 
samples up to z max = 0.14. We have also obtained the void 
statistics for the volume limited samples up to z max = 0.2. 
For this case, we have detected ~ 170 voids with radius 
larger than 13/i -1 Mpc defined by galaxies brighter than 
M bj < -20.181 + 51og/i. 

The number density of voids larger than R found in the 
SGP and NGP samples are in good agreement with each 
other for all values of R. We have obtained the VPF for both 
strips finding results in good agreement w ith previous ones 
JCroton et al.l2004l:lHovle fc Vogelevl2004l) . We have shown, 
using an appropriate expression for the VPF sampling er- 
rors, that the results found in both strips are statistically 
compatible. 

From the results of several statistical tests we conclude 
that except for the anticorrelation implicit in the fact that 



the maximal sphere are chosen so that they do not overlap, 
they are essentially uncorrelated. There is, however, some 
evidence for a weak additional anticorrelation, which may be 
easely explained within the standard scenario of structure 
formation. We conclude at least at 99.5% confidence level 
that there is a dependence of the void number density on 
redshift. We do this by means of a modified version of the 
Vmax test which reveals a small trend towards small z values. 

We have also obtain preliminary results on the galaxy 
contents of nearby voids found in volume-limited galaxy 
samples in the 2dFGRS. For the nine nearby voids up to 
2 < 0.095 in our sample, we have found inside them on 
average only 15 galaxies fainter than —19.32 + 5 logfo (the 
magnitude of the galaxies which define the voids). These 
voids are rather empty compare to those defined by brighter 
galaxies. The galaxies within the voids are not randomly 
distributed: they are clustered forming well define filamen- 
tary structures as that observed in the large-scale structure 
of the galaxy distribution of the Universe. Moreover, this 
is the same pattern that show the dark matter halo distri- 
bution found inside voids in high resolution N-body sim- 
ulations. The halos inside voids are distributed in a way 
that resemble a minia ture of the Universe (see Figure 2 in 
IGottlober et alJ <l2003l) ). We have also obtained the number 
density profile for the galaxies inside the voids. The num- 
ber density of faint galaxies fall almost a factor 7 from the 
border of the voids to the inner half. Moreover, the num- 
ber density of galaxies close of the borders of the voids are 
still too low compared wit h the field (almost a factor 5). 
IGottlober et alJ (120031) and lPatiri. Betancort-Riio fc Pradal 
l2004F liave obtained the number density profile for halos in 
voids. From the comparison of these results we can see that 
even though the number density of galaxies and halos follow 
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1 WWW X [Kpc/h] « [Mpc/h] 

Figure 5. Three examples of faint galaxies in voids. We show for each void a 6h~ x Mpc central slab. In the left panel is shown a 
13.5/i — 1 Mpc void, in the central pannel a 14.0/i — x Mpc void and in the right pannel a I4.7h~ 1 Mpc void. The filled circles are the galaxies 
brighter than —19.32+5 logfc that define the voids. The open circles are the fainter galaxies inside the voids down to Af^ lm = —18.3+5 log/i. 
Note that even though there is not a large population of faint galaxies in these voids, the galaxies are not randomly distributed, i.e. they 
are distributed in filamentry structures similar to those found in another scales (see central and right panel). 



similar trends, the galaxy profile is steeper than halos (they 
fall just a factor 2). 

In a future work, we will use the frame work devel- 
oped in IPatiri. Betancort-Riio fc Pradal J2004T) along with 
the results given here for the number density of voids 
and their redshift dependence in order to constrain the 
value of as- Furthermore, using a halo occupation model 
(e.g. Berlind et al. 2003) alo n g the lines described in 
IPatiri. Betancort-Riio fc Pradal {2004) we shall establish 
constraints in the relationship between halos and galaxies 
from void statistics and the statistics of the galaxies inside 
the voids. In particular, we hope to be able to determine 
whether or not the conditional luminosity function depends 
on environment. 

In the other hand, the study of the physical properties 
of the galaxies inside voids could imply constraints on the 
galaxy formation processes. So, we will analyze the physical 
properties like colors, metalicities, star formation rates, etc. 
of galaxies in rare voids that are available from the biggest 
galaxy surveys (2dFGRS and SDSS) in order to test, for 
example, the galaxy luminosity function in voids and the 
density-morphology correlations. 
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APPENDIX A: VOID FINDER ALGORITHMS 

Al Algorithm 1: Cell Void Finder 

The first step is to select the sample of galaxies or dark mat- 
ter halos in our redshift survey or cosmological simulation 
that will define the voids. We will select galaxies with lumi- 
nosity greater than a given value L or halos with virial mass 
greater than M . 

Once the galaxy or halo sample has been selected our 
code generates a cubic mesh where the cell size determines 
the working resolution. Afterwards, the objects are assigned 
to cells. So we have three types of cells: the cells that contain 
galaxies or halos (filled cells), the cells that are empty but 
are inside the observational domain (observed empty cells) 
and those cells which are also empty but they are located 
outside the observational domain or in a not observed re- 
gion inside the observational domain (a 'hole') (not observed 
empty cells). The code searches for the voids among the ob- 
served empty cells inside the observational domain. In the 
case of cosmological simulations the dark matter halo sam- 
ples are in 3D boxes and generally all the cells are inside the 
observational domain. However, in the case of galaxy sam- 
ples in redshift surveys which have irregular geometry, some 
of the cells are located outside the observational domain. 
We can easily determine which cell has been observed and 
which has not by using the survey masks. 




Figure Al. An example of labeling of cells for our Cell Void 
Finder. In order to save computational time, we mark the empty 
cells (dark grey cells) that are neighbours of those cells that con- 
tain objects (black cells). Note that this is an iterative process, i.e. 
we can now mark the empty cells that are neighbours of already 
marked empty cells (light grey cells). The stop of this iteration 
depends on the adopted resolution. The center of the void will be 
searched over the empty unmarked cells (white cells). 



Once the cell classification procedure have been com- 
pleted the code is then ready to search for the voids, finding 
their centers and radii. In principle, each observed empty 
cell could be a potential void center, but it is easy to re- 
alize that the observed empty cells which are located close 
to filled cells that contain galaxies or halos will not be the 
center of a void. So, in order to save computational time, 
we mark these neighbours cells and they will not be taken 
into account at the time of searching for the void centers. 
This is an iterative process, i.e. once we have marked the 
observed empty cells closer to the filled cells we can go to 
the next level and mark the observed empty cells that are 
neighbours of already marked empty cells. Note that we will 
stop this iteration depending on the working resolution (see 
Figure Al). 

To determine the void centers, the code computes the 
distances between each of the unmarked observed empty cell 
and all the galaxies or halos in the whole observational do- 
main and we retain the minimum distance. Once we have 
the list with the minimum distances, we search for the local 
maxima which corresponds to the void centers. Obviously, 
the voids radius are those maximum distances. 

Finally, the code removes the overlapping maximal 
spheres, keeping the biggest one, i.e. if the distance between 
two maximal spheres is less than the sum of their radius, 
then the voids overlap and we remove the smaller one. 

The main advantage of the Cell Void Finder algorithm 
is that in only one run we get all the voids in the sample. 
However, its main disadvantage is that it consumes quite 
a lot of memory, which scale with the resolution that we 
require. There is a similar memory problem when we have a 
large number of galaxies or halos in the sample. If we are only 
interested in the biggest voids (rare voids) this algorithm 
is not the best strategy. Some studies are focused in these 
kind of voids, so, here we have developed a complementary 
algorithm which is more efficient in this respect. 
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A2 Algorithm 2: HB Void Finder 

Here we give the details of our second algorithm whose main 
task is the detection of rare voids and we will use it as a 
complement of the CELLS algorithm. The HB Void Finder 
is conceptually simple. It searches for the non-overlapping 
maximal spheres with radius larger than a given value. As we 
mentioned above this code is designed for statistical studies 
focused on the biggest voids. In these cases the code is very 
accurate and computationally efficient as compared with our 
Cell Void Finder. 

The first step in the algorithm is to generate a sample of 
random trial spheres with a fixed radius R. These spheres are 
generated directly over the observational domain with the 
condition that the entire sphere lies inside the observational 
domain. We check which spheres contains no objects and 
keep them. 

In the next step we find for each trial empty sphere 
the four nearest object and we expand the sphere to con- 
tain them in its surface. These new spheres are potentially 
maximal (see subsection A4). Note that the new expanded 
spheres could in principle contain objects. If this is the case, 
those spheres are removed. The potential maximal spheres 
will be actual maximal spheres if they do not overlap, which 
is decided by means of the same criteria as for the Cell Void 
Finder (i.e. if the distance between two spheres is less than 
the sum of their radius they overlap, so we keep the largest 
one) and if the four nearest objects are not located in the 
same hemisphere (see Figure A2). We put in both algorithms 
the additional constrain that the maximal spheres have to 
be entirely inside the observational domain. Note that, wc 
need to do many realizations of the trial spheres in order 
to get the maximal spheres. Typically, four realizations are 
needed for each void in order to reach its maximum radius, 
when the maximal sphere is only slightly larger than R and 
an increasing number as the maximal sphere is larger with 
respect to R. 



A3 Performance test 

We construct random samples of objects in order to test 
the code performance and check the results of both algo- 
rithms. We generate two samples, one with 1,000 points and 
another one with 10,000 points. Both samples are in a box 
of 100 Mpc. We give in Table Al the void statistics com- 
puted from both algorithms. Note that the agreement be- 
tween codes is excellent. We obtain the void statistics using 
~ 10 7 trial spheres with the HB Void Finder. The resolution 
of the Cell Void Finder is 0.5 Mpc. 

The cpu time of the codes mainly depends on the num- 
ber of particles, the number and radius of trial spheres in 
the case of the HB algorithm and on the number of cells (i.e. 
resolution) and the levels of neighbours cell marking for the 
Cell Void Finder. For example, in the case of the sample with 
1,000 random particles and for the voids with radius larger 
than 10 Mpc, the HB Void Finder takes ~ 1 hour for 10 7 
trials, while the Cell Void Finder takes ~ 3 hours with a res- 
olution of 0.5 Mpc. Notice that the voids with radius larger 
than 10 Mpc are common in this box, so the running time 
differences are not so big between both algorithms. How- 
ever, if we search for voids with radius larger than 12 Mpc, 
the HB Void Finder takes only 20 minutes, while the Cell 



Table Al. Voids in random samples. We list the statistics of 
the maximal spheres found by our algorithms in the sample of 
1,000 and 1,0000 random points in a Box of 100 Mpc. N HB is the 
number of voids found with the HB algorithm with radius larger 
than the value given in the first column. Nqells is the same 
but for the voids found with the Cells Void Finder. See text for 
details. 



Sample 


Box: 


100 Mpc 


1000 






Radius 


N H b 


NcELLS 


Mpc 






10.0 


23 


25 


11.0 


15 


15 


12.0 


7 


7 


10000 


5.5 


69 


73 


6.0 


20 


21 


6.5 


3 


3 


7.0 


1 


1 



Void Finder last the same ~ 3 hours. These tests were done 
in a Pentium IV processor (3.06 GHz clock and 2GB RAM) 
and in a Itanium-2 processor (1.5 GHz clock, 2GB RAM) 
giving both similar performances. 

A4 How to grow the trial spheres 

To determine the sphere passing through the four nearest ob- 
jects to an empty trial sphere we proceed as follow: We first 
take the two nearest objects (whose coordinates we denote 
as "afi and it 2) and calculate the middle point (~af i+lr*2)/2. 
From this point, we move along a vector in the plane contain- 
ing the three nearest objects and perpendicular to ~S?2 — afi 
until we reach the point, q, where the distances to the third 
nearest object (~a?3) is the same as that to object 1, then 
the distance between object 2 and if is also the same that 
the previous two (See figure A3). Then we need to solve: 



Xl 



If (itfo)| = 1^3 - ~<f (wo)| 



where 

1 ( w ) = n l-WJ 



and 



~?i3 — ("e > i3-"e > i2)"e > i2 
"5*13 — (~e > i3-~e > i2)"e > i2| 



where 



e 12 : 



"afi — ~a? 2 
|~a?i - ^2 



■; e 13 



a?i — 2*3 
a? 3 | 



(Al) 



(A2) 



(A3) 



(A4) 



with w G (— 2Ro, 2Ro), Ro is the radius of the trial sphere. 

Now, we repeat the same procedure described above 
but taking into account the fourth object, i.e. we move from 
~q*(u>o) perpendicularly to the plane of the figure Al until 
we reach the point ~P(t) where the distance between "a?4 and 
~P(t) is the same as the distance from "afi to ~P(t). So, 



~P(t) = -?(w )+tTt 
where 



(A5) 
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[t] 

Figure A2. Sequence of steps in our HB Void Finder: a) we throw a sphere of fixed radius (dotted circle) over the observed sample 
of objects, b) if this sphere is empty of objects defining the void (filled dots), we then grow the sphere (dashed circle) until its surface 
contains the four nearest objects (three in 2-D, see Appendix A). This is a potential maximal sphere. We throw another random sphere 
(dotted circle). Again, if this one is empty, we obtain the potential maximal sphere (dashed circle in panel c). d) Finally, we check if 
the new maximal sphere overlaps with another one; if so, we keep the largest sphere. This is our void (filled circle). See text for details. 
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Figure A3. 






ft = lfl2 A !?13 

Solving 

~&i - ~P(t)\ = |^4 






(A6) 
(A7) 



with t G (— 2Rq, 2Ro), we finally obtain the coordinates of 
the center of the maximal sphere, ~P{t), and its radius R, 
which is simply given by |~a?i —~P(t)\. 
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